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ABSTRACT  A  phylogeny  was  reconstructed  for  four  species  belonging  to  the  Neotropical  Anopfteles 
( Nyssorhynchus )  albitarsis  complex  using  partial  sequences  from  the  mitochondrial  cytochrome 
oxidase  I  ( COI )  and  NADU  dehydrogenase  4  ( ND4 )  genes  and  the  ribosomal  DNA  ITS2  and  D2 
expansion  region  of  the  28S  subunit.  The  basis  for  initial  characterization  of  each  member  of  the 
complex  wits  by  correlated  random  amplification  of  polymorphic  DNA-polymerase  chain  reaction 
(RAPD-PCR)  markers.  Analyses  were  carried  out  with  and  without  an  outgroup  (An.  ( Ntjs. )  argy- 
ritarsis  Robineau-Desvoidy)  by  using  maximum  parsimony,  maximum  likelihood,  and  Bayesian  meth¬ 
ods.  A  total  evidence  approach  without  the  outgroup,  using  separate  models  for  “fast"  ( COI  and  ND4 
position  3)  and  “slow"  (rDNA  ITS2  and  D2,  and  COI  and  ND4  position  l)  partitions,  gave  the  best 
supported  topology,  showing  close  relationships  of  An.  albitarsis  Lynch-Arribalzaga  to  An.  albitarsis 
B  and  An.  marajoara  Galvao  &  Dainasceno  to  An.  deaneorutn  Rosa-Freitas.  Analyses  with  the  outgroup 
included  showed  poorer  support,  possibly  because  of  a  long  branch  attraction  clTcct  caused  by  a 
divergent  outgroup,  which  caused  one  of  the  An.  marajoara  specimens  to  cluster  with  An.  deaneorutn 
in  some  analyses.  The  relationship  of  the  above-mentioned  result  to  a  separately  proposed  hypothesis 
suggesting  a  fifth  species  in  the  complex  is  discussed. 
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Anopheles  ( Nyssorhynchus )  marajoara  Galvao  & 
Dainasceno,  a  member  of  the  Albitarsis  Complex,  was 
recently  recognized  as  the  primary  vector  of  malaria 
parasites  in  northeastern  Amazonia,  Brazil  (Conn  et 
al.  2002).  Other  species  in  the  complex  are  An.  albi¬ 
tarsis  Lynch-Arribalzaga,  An.  deaneorutn  Rosa-Freitas, 
and  an  unnamed  species  “B"  (Kreutzer  et  al.  1976; 
Linthicum  1988;  Narang  et  al.  1993;  Rosa-Freitas  and 
Deane  1989;  Wilkerson  et  al.  1995a,  b).  From  studies 
carried  out  in  Rondonia  Slate,  Brazil,  there  is  also 
evidence  to  support  the  importance  of  An.  deaneorunt 
as  an  important  malaria  vector  (Klein  et  al.  1991a,  b). 
Wilkerson  el  al.  (1995a,  b)  separated  the  four  largely 
isomorphic  species  by  using  species-specific  random 
amplified  polymorphic  DNA  (RAPD-PCR)  markers. 
Their  analysis  relied  on  an  empirical  assumption  of 
multiple  correlated  “fixed”  markers  to  hypothesize 
reproductive  isolation,  similar  to  the  use  of  correlated 
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morphological  characters  for  the  same  ends,  but  they 
did  not  address  confounding  factors  inherent  in  RAPD 
markers  such  as  possible  nonhomology  of  comigrating 
bands  or  linkage  of  markers.  Lehr  et  al.  (2005),  based 
on  complete  mitochondrial  DNA  cytochrome  oxidase 
I  (mtDNACO/)  sequence,  question  the  validity  of  this 
approach,  suggest  a  fifth  species,  and  present  a  COI 
gene  tree  for  all  five.  Using  specimens  from  the  Wilk¬ 
erson  et  al.  (1995a,  b)  RAPD-PCR  studies,  Merritt  el 
al.  (2005)  analyzed  a  portion  of  the  white  gene  that 
contains  the  white  fourth  intron.  They  found  that  the 
intron  was  present  in  An.  marajoara  but  not  in  the 
other  three  species  (also  noted  by  Krzywinski  et  al. 
(2001)  for  An.  albitarsis).  Phylogenetic  analysis  of 
coding  sequence  in  the  area  of  the  fourth  intron, 
correlated  with  intron  loss  hypotheses,  resulted  in 
strong  support  for  a  single  intron  loss  event  in  this 
species  complex  and  gave  a  topology  different  from 
that  found  both  by  Lehr  et  id.  (2005)  and  from  that 
presented  here. 

Anopheles  subgenus  Nyssorhynchus  includes  33  spe¬ 
cies  (Ilarbach  2004),  including  the  two  most  impor¬ 
tant  vectors  in  the  New  World  tropics:  An.  darling i 
Root  (Linthicum  1988)  and  An.  albimantis  Wiede¬ 
mann  (Faran  1980) .  The  subgenus  is  divided  into  three 
sections  based  on  morphological  characters:  the 
Argyritarsis  Section,  which  includes  the  Albitarsis 
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Table  1.  List  of  species  and  source  uf  upecimena  used  in  this  study 


Species  i 

Tree 

reference 

no 

Collection  no. 

Locality  of  specimen 

Coordinates 

Gen  Bank  accession  no. 

(AY  prefix) 

Ingroup 

ITS2 

D2 

SD4 

COI 

An.  aUAtanvt 

A2 

BR5I1  (3) 

Parana,  near  Cuaint 

24°  OT  S54°  15'  W 

828322 

846323 

846340 

846507 

An.  rtZ/n'torsw 

A3 

BR501(38) 

Sao  Paulo,  6  km  SW  Reeistro 

24°  36.8’  S  47°  53. 1  *  W 

828322 

846324 

846341 

846308 

An.  fliJn'torsis 

A-l 

BR500(16) 

Sao  Paulo.  6  km  SW  Ret^stro 

•24°  36.8'  S  47°  53.1'  W 

828322 

846325 

846342 

846309 

An.  albitarsis 

B5 

BR017(4) 

Bahia.  Ilutjuani 

13°  26' S  39*56' W 

828321 

846326 

846343 

846310 

An.  aMii  tarsi* 

B6 

BR/niO015(l) 

Bahia,  Itaquara 

I3°26'S39“56'W 

828324 

846327 

846341 

846311 

An.  albitarsis 

BY 

BR500(45) 

Siio  Paulo,  6  km  SW  Registro 

24°  36.8'  S  47"  53. 1 '  W 

828321 

846328 

846345 

846312 

An.  albitarsis 

B8 

BR/RIO015<3) 

Bahia.  Itaquaru 

13°  26'  5,39*56'  VV 

828324 

846329 

846346 

846313 

An.  marajoara 

C9 

BROWS) 

Mato  Grosso,  Peixoto  de 

l(r*23,S54°54'VV 

828339 

846330 

846341 

816314 

Azevedo 

An.  uwrajoara 

CIO 

BR026U2) 

Amazonas,  Manaus 

2°  53’ S  60“  15' W 

828.339 

816331 

846348 

846315 

An.  marajoara 

Cll 

BR544(83) 

Mato  Crosso,  Peixoto  de 

Iff  23'  S  54°  54'  W 

828339 

816332 

846349 

846316 

Azevedo 

An.  tnarajoara 

C12 

BR/R001<3) 

Pant  Illia  de  Marajrt 

l°00'S4ff30'W 

82S328 

846333 

846350 

846317 

An.  dt'aneojiini 

D15 

BR/RIO007(ll) 

Ronddnia,  Guajara-Mirim 

10* 50'S 65° 20'  W 

828332 

846331 

846351 

846318 

An.  deaneorum 

DI6 

BR700(6) 

Ronddnia.  Ariquemes 

9*  56'  S  63*  OT  W 

828330 

846335 

846352 

846319 

An.  {/emuunmi 

D17 

BR/ RJO007 ( 17). Dl 7  Ronddnia,  Guajani- Mirim 

10f*  50'  S  65*  20’  W 

828330 

816336 

846353 

846320 

Outgroup 

An.  tarns 

AnarsyI8 

V7.31.100 

Venezuela.  Merida,  Rd  to  Jaji 

T  37.31 '  N  72°  25.92'  W 

819553 

816337 

846354 

646321 

Aa  argyritarsis 

Anargyl9  VZ11-6 

Venezuela.  Merida,  Rd  to  Jaji 

7°  37.31 'N  72°  25.92'  W 

849553 

846338 

846355 

846321 

An.  argyritarsis 

Anargy*20 

BRIO-112 

Ceara,  Ubajaru 

3°  53.27'  S  49*51.25'  W 

849554 

846339 

846356 

846322 

All  localities  are  in  Brazil  unless  otherwise  indicated. 


Complex  and  An.  darlingi;  the  Albimanus  Section, 
which  includes  An.  albimanus ;  and  the  little  known 
Myzorhynchella  Section  (Peyton  et  al.  1992).  Rela¬ 
tionships  among  the  33  included  species  are  not 
well  resolved,  but  it  is  known  that  the  Argyritarsis 
and  Albimanus  Sections  are  paraphyletic  relative  to 
each  other  because  one  putative  clade  contains  both 
An.  darlingi  and  An.  albimanus,  suggesting  a  possible 
evolutionary  link  to  vector  capacity  (Conn  1998, 
Sallum  et  al.  2000).  Cryptic  species  are  common  in 
An.  ( Nyssorhynchus )  and  in  Anopheles  in  general,  and 
most  groups  that  are  closely  studied  yield  new  taxa 
(Rosa-Freitas  et  al.  1998),  with  An.  darlingi  being  an 
apparent  exception  (Manguin  et  al.  1999).  Consider¬ 
ing  the  medical  importance  of  this  complex  and  our 
general  lack  of  knowledge  regarding  the  relation¬ 
ships  within  subgenus  Nyssorhtjnchm,  we  undertook 
this  study  to  corroborate  results  produced  by  RAPDs 
and  to  investigate  how  the  Albitarsis  Complex  species 
are  related  to  each  other.  We  report  here  a  molecular 
phylogenetic  analysis  of  the  four  species,  initially  sep¬ 
arated  by  RAPDs,  by  using  two  ribosomal  DNA 
(rDNA)  sequences,  internal  transcribed  spacer  two 
(ITS2)  and  the  D2  expansion  of  the  28S  subunit  (D2), 
and  partial  sequence  from  two  mitochondrial  genes, 
NADH  dehydrogenase  4  (ND4)  and  COI. 

Materials  and  Methods 

Source  and  Identification  of  Specimens.  Mor¬ 
phological  characters  from  Linthicum  (1988)  were 
used  for  identification  of  An.  ( Nys .)  albitarsis  s.l.  and 
An.  (iVi/s.)  argyritarsis  Robineau-Desvoidy,  the  out¬ 
group  species.  The  ingroup  specimens  also  were  used 
by  Wilkerson  et  al.  ( 1995a,  b)  for  their  studies  (Table 
1).  These  were  identified  to  species  using  species- 
specific  RAPD  markers  as  described  by  them.  For 


DNA  analysis,  we  used  individuals  from  progeny 
broods  preserved  in  100%  ethyl  alcohol  maintained  at 
-70°C.  A  portion  of  each  brood  was  retained  for  mor¬ 
phological  study  and  includes  individually  reared  pin¬ 
pointed  adults  with  associated  pupal  and  fourth  instar 
exuviae  that  are  deposited  in  the  Smithsonian  Insti¬ 
tution,  National  Museum  of  Natural  History  (NMNH) 
and  the  Faculdade  de  Saude  Publica,  Universidade  de 
Sao  Paulo  (FSP-USP).  DNA  voucher  specimens  are 
deposited  in  NMNH. 

Laboratory  Methods.  DNA  was  extracted  from  in¬ 
dividual  mosquitoes  of  each  species  as  described  in 
Wilkerson  et  al.  (1993).  Portions  of  the  mitochondrial 
COI  and  the  ND4  genes  and  the  nuclear  rDNA,  28S  D2 
expansion  region,  and  the  ITS2  were  amplified  and 
sequenced  for  at  least  three  individuals  of  each  spe¬ 
cies.  The  region  of  each  gene  that  was  sequenced, 
sequences,  and  positions  of  the  primers  used  in  this 
study  are  in  Table  2. 

PCR  reactions  were  carried  out  in  a  total  volume  of 
50/d  by  using  standard  protocols  (Palumbi  1996).  PCR 
temperature  profiles  to  obtain  the  above-mentioned 
sequence  were  initial  denaturation  at  95°C  for  3  min 
followed  by  40  cycles  of  denaturation  at  943C  for  40  s; 
annealing  at  56°C  for  40  s  (1TS2  and  COI)  or  52°C  (D2 
and  ND4 );  and  extension  at  72°C  for  1  min  and  final 
extension  at  72°C  for  10  min. 

For  sequencing,  PCR  products  were  purified  using 
polyethylene  glycol  (PEG)  precipitation  (20%  PEG 
8000  and  2.5  M  NaCl).  Sequencing  reactions  were 
carried  out  directly  on  both  strands  of  DNA  by  using 
ABI  Big  Dye  chemistry  (Applied  Biosystems,  Foster 
City,  CA),  and  the  sequences  were  generated  with  an 
ABI  377  automated  sequencer.  The  sequences  were 
analyzed  and  questionable  base  calls  resolved  using 
Sequencher  3.0  (Gene  Codes  Corp.,  Ann  Arbor,  MI). 
Sequences  were  initially  aligned  using  ClustalX,  ver- 
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Table  2.  Sequences  of  COI ,  iVW  /,  1)2,  mid  1TS2  primers  used  in  tin*  Mudy 


Designation 

Sequence  (5' *3') 

Position 

Reference 

Fly5IP(CO/) 

CGATTATTACCATTTATTGT 

842-861“ 

Sallum  et  al.  (2002) 

FlvlOlP(COf) 

CCAAATAATCAAATTGTTCT 

1.170-1392" 

Sallum  cl  al.  (2002) 

ND4F 

CCAGAACTAATAAAAATCACCAT 

This  study 

ND-m 

GCAGGAGTTTTATTAAAGTTAGG 

This  study 

D2F 

AGTCGTGTTGCTTCATAGTG 

288-307'' 

Sallum  et  al.  (2002) 

D2R 

CTTGGTCCGTGTTTCAACAC 

821-840'* 

Sallum  et  al.  (2002) 

1TS2F 

TGTGAACTGCAGGACACATGAA 

3'  end  5.8S 

Cornel  et  al  (1996) 

TTS2R 

ATGCTTAAATTTAGCCCCTAGTC 

5'  end  28S 

Cornel  et  al.  (1996) 

u  Nucleotide  position  relative  to  COI  and  COIl  sequence  of  An.  qiiadriumcufutus  (NC000vS75).  except  for  Fly  10  whose  position  is  relative 
to  COI  gene  in  Drosophila  yakuha. 

'  Nucleotide  position  relative  to  28S  sequence  of  An. 


sion  1.8  (Thompson  et  al.  1997)  and  then  compared 
and  visually  aligned  using  Se-Al  version  2.0a9  (Se¬ 
quence  Alignment  Editor,  A.  Rambaut,  University  of 
Oxford)  or  MacClade  (Maddison  and  Maddison 
2000).  GenBank  accession  numbers  for  all  sequences 
are  in  Table  1. 

Phylogenetic  Analyses.  Unweighted  parsimony 
analyses  were  done  using  PAUP  4.0bl0  (Swofford 
2004)  by  using  the  heuristic  search  option  with  TBR 
branch-swapping  with  1000  random-taxon-addition 
replicates.  Parsimony  bootstrapping  (Felsenstein 
1985)  used  1000  pseudoreplicates,  with  10  random- 
taxon-addition  replicates  per  pseudoreplicate.  Parsi¬ 
mony-uninformative  characters  and  the  hyper¬ 
variable  sites  were  excluded  from  all  the  analyses. 

For  maximum  likelihood  (ML),  we  used  PAUP 
4.0bl0  (Swofford  2004).  Starting  models  were  chosen 
with  ModelTest  (Posada  and  Crandall  1998)  by  using 
the  Akaike  Information  Criterion  (AIC).  The  result¬ 
ing  tree  was  saved  and  also  used  to  test  site-specific 
models.  Some  maximum  likelihood  determinations 
and  data  manipulations  were  done  using  p4  (Foster 
2004).  Maximum  likelihood  searches  started  with  a 
neighbor-joining  tree,  on  which  we  then  optimized 
parameters  and  fixed  the  values  for  those  parameters 
for  branch  swapping  on  that  tree.  Several  ML  search 
rounds  were  carried  out  until  the  parameters  were 
fully  optimized. 

The  program  p4  (Foster  2004)  was  used  to  bootstrap 
the  data,  which  allowed  bootstrapping  of  partitioned 
data  under  a  site-specific  model.  A  consensus  of  the 
trees  from  200  bootstrap  replicates  was  made  in  PAUP. 
Branch  lengths  of  that  consensus  tree  were  optimized 


using  the  search  model  and  the  original  data,  and 
bootstrap  support  values  from  the  PAUP  tree  bipar¬ 
titions  table  were  placed  on  the  tree  using  p4. 

For  Bayesian  analysis,  we  used  the  program  MrBayes 
(Huelsenbeck  and  Ronquist  2001).  Where  possible, 
the  same  model  used  for  ML  analysis  was  used  in 
Bayesian  analysis,  however  when  that  model  was  not 
implemented  in  MrBayes  then  the  next  more  complex 
model  available  in  MrBayes  was  used.  When  a  site- 
specific  model  was  used,  we  used  site  rates,  rate  ma¬ 
trices,  compositions,  and  among-site  rate  variation 
specific  to  each  partition.  Markov  chain  Monte  Carlo 
(MCMC)  runs  were  500,000  generations  long,  sam¬ 
pling  every  250  generations,  for  a  total  of  2,001  sam¬ 
ples.  Of  these,  the  first  1,001  sample  were  discarded  as 
bum-in,  which  is  well  past  the  point  where  the  like¬ 
lihood  plot  reached  a  plateau. 

To  estimate  likely  placements  of  the  root  for  the 
four  taxa,  we  used  p4  to  compile  the  sampled  MCMC 
trees  with  outgroup  attachment  information  pre¬ 
served  to  show  the  distribution  of  root  positions. 

Results 

In  total,  1,846  sites  were  included  in  the  analysis 
(Table  3).  Because  the  outgroup  was  very  divergent, 
we  suspected  that  it  might  cause  us  to  choose  inap¬ 
propriate  models  or  be  responsible  for  long  branch 
effects.  To  test  this,  the  analyses  were  conducted  with 
and  without  the  outgroup.  To  better  fit  models  to  the 
data,  we  reasoned  that  the  available  genes  could  be 
separated  into  two  groupings  based  on  apparent  rel¬ 
ative  substitution  rates,  “fast"  and  "slow'’,  with  the 


Table  3.  CuitHimil,  variable,  and  purnimoiiy  infonnulive  siifs  in  the  ingroup  only  and  with  the  outgroup  included 


Partition 

nSitc 

nConstSite 

nVariableSite 

nParslnformativeSite 

with 

without 

with 

without 

with 

without 

with 

without 

ITS24 

473 

473 

449 

463 

59 

10 

53 

7 

D2 

550 

559 

544 

553 

15 

fi 

9 

4 

CO/Posl 

142 

142 

135 

137 

7 

5 

6 

4 

CO/Posl 

142 

142 

141 

142 

1 

0 

1 

0 

C.OfPos.3 

143 

143 

78 

103 

65 

40 

51 

24 

■VWPosl 

129 

129 

121 

124 

8 

5 

3 

o 

.\7MPos2 

129 

129 

128 

129 

1 

0 

1 

0 

■\7MPos3 

129 

129 

64 

SO 

65 

49 

55 

38 

Hypervariablc  regions  excluded;  includes  91  bases  in  flanking  regions. 


November  2005 


WlLKKRSON  CT  AL.:  PHYLOGENETICS  OF  NEOTROPICAL  AliO))heles 


921 


Ttibli*  4.  ML  lionatrap  suppnrl  mill  poslrriur  prolmliility  (PP)  fur  relaliotti<lli[M  within  tin-  An.  albitania  complex,  when  the  outgroup 
both  included  and  excluded 


Phylogenetic  group 

COI  +  SIM  pos  3 

ITS2  +  D2+ 

( COI  +  ND4  pos  1) 

rDNA  +  mtDNA 

with 

without 

with 

without 

with 

without 

ML 

PP 

ML 

PP 

ML 

PP 

ML 

PP 

ML 

PP 

ML 

PP 

An.  albitarsis 

0.69 

0.93 

0.96 

1.00 

0.53 

0.80 

0.76 

0.65 

0.92 

0.89 

1.00 

1.00 

An.  albitarsis  11 

0.88 

0.93 

0.99 

1. 00 

0.77 

0.81 

0.88 

0.98 

0.99 

0.96 

LOO 

1.00 

An.  deaneorum 

0.88 

0.91 

0.81 

0.80 

0.83 

0.95 

0.83 

0.76 

An  marajoara 

0.62 

0.78 

0.91 

0.97 

0.52 

0.61 

0.99 

1.00 

An.  marajoara  (C9,  CU,  C12) 

0.89 

1.00 

0.86 

1.00 

0.58 

0.94 

0.69 

0.93 

An.  deaneorum  (D15,  DIG) 

0.31 

0.63 

0.4S 

0.68 

0.45 

0.73 

0.69 

0.65 

0.67 

(An  marajoara,  Aw.  deaneortmi) 

0.51 

0.59 

0.99 

1.00 

0.25 

0.52 

0.57 

0.63 

0.66 

0.63 

1.00 

LOO 

(An.  atbilanis,  An.  albitarsis  B) 

0.51 

0.59 

0.99 

1.00 

0.25 

0.40 

0.76 

0.63 

0.66 

1.00 

1.00 

(An.  marajoara,  (CIO,  DI7)) 

0.32 

0.37 

0.52 

0.65 

(An.  marajoara  (CIO),  An.  deaiworum  (DI7) 

0,25 

0.45 

0.63 

0.86 

mtDNA  position  3  partition  (fast)  in  one  group  and 
tiie  remaining,  mtDNA  position  1  and  rDNA  (ITS2, 
D2),  in  the  other  group  (slow).  The  data  were  not 
subdivided  further  because  of  lack  of  variation.  The 
data  groupings  as  described  above  as  well  as  the  num¬ 
ber  of  parsimony  informative  characters  are  given  in 
Table  3. 

Phylogenetic  Analysis  with  the  Outgroup  Excluded. 
A  summary  of  branch  supports  for  ML  and  Bayesian 
analysis  for  fast  partition,  slow  partition,  and  com¬ 
bined  data  are  given  in  Table  4.  The  best  resolution 
was  obtained  with  combined  data  (Fig.  1). 

Brief  mention  will  first  be  made  here  of  the  separate 
analyses  of  fast  (mtDNA  position  3)  mid  slow 
(mtDNA  position  1  plus  rDNA)  partitions  (Table  4). 
Using  the  mtDNA  position  3  partition  (fast)  only, 
the  topologies  of  the  ML  and  Bayesian  analyses  were 
the  same,  and  showed  resolution  of  An.  albitarsis  and 
An.  albitarsis  B  as  separate  groups,  and  good  resolution 
of  the  (An.  deaneorum.  An.  marajoara)  group  from  the 
others  (designated  “A,”  "B,"  “D,”  and  "C,"  respectively 
in  Figs.  1  and  2) .  However,  sequences  of  An.  marajoara 
and  An.  deaneorum  were  not  recovered  into  two  sep¬ 
arate  nonexclusive  clades  (de  Queiroz  1998)  because 
An.  marajoara  (CIO)  and  An.  deaneorum  (D17)  clus¬ 
tered  together  in  a  poorly  supported  clade  that  was 
closer  to  An.  marajoara  than  to  An.  deaneorum.  Using 
the  slow  partition  by  itself  (mtDNA  position  1  and 
rDNA),  there  was  more  ambiguity,  and  the  topology 
for  the  ML  analysis  differed  from  the  Bayesian  anal¬ 
ysis.  Support  for  separate  groups  was  generally  poor, 
with  the  highest  support  for  the  branch  separating  the 
entire  An.  marajoara  sequence-group  from  the  rest  of 
the  tree. 

Parsimony  analysis  of  the  combined  rDNA  and 
mtDNA  data  sets  generated  six  most  passinionious 
trees  (MPTs)  (not  shown).  The  strict  consensus  tree 
generated  from  those  six  MPTs  recovered  three  well- 
supported  groups:  An.  albitarsis.  An.  albitarsis  B,  and 
(An.  marajoara  +  An.  deaneorum).  The  latter  also 
were  recovered  as  separate  groups  but  with  less  sup¬ 
port:  An.  deaneorum  with  72%  and  An.  marajoara  with 
91%  bootstrap  support. 

ML  and  Bayesian  analyses  were  carried  out  using 
the  mtDNA  position  3  in  1  partition,  and  mtDNA 


position  1  plus  rDNA  data  in  another  partition.  For  the 
unpartitioned  data,  ModelTest  suggested  the  TVM  + 
I  model;  however,  the  TVM  +  SS,  a  site-specific  model 
bused  on  the  two  partitions,  gave  a  better  likelihood, 
showing  an  increase  of  87.6  log  units,  so  the  TVM  +  SS 
model  was  used  in  all  ML  analyses.  The  ML  topology, 
including  the  nonparametric  bootstrap  support  values, 
is  shown  in  Fig.  1.  For  Bayesian  analysis,  the  data  were 
partitioned  in  the  same  way.  The  general  time-revers- 


Fig.  1.  Results  of  combined  data.  The  data  were  placed 
in  two  partitions  as  described  in  the  text,  consisting  of 
mtDNA  position  3  from  COI  and  ND4  in  partition  1,  and 
mtDNA  position  1  and  ribosomal  sequences  D2  and  ITS2  in 
the  other.  The  maximum  likelihood  bootstrap  tree  and  the 
Bayesian  consensus  tree  had  identical  topologies,  as  shown, 
with  support  (ML  bootstrap/ Bayesian  posterior  probability) 
shown  for  the  major  groups.  The  ML  analysis  used  the  TVM 
+  SS  model  in  PAlfP.  The  Bayesian  analysis  used  a  site- 
specific  model,  where  a  GTR  +  G  model  was  applied  to  the 
position  3  mtDNA  data  partition  and  GTR  +  plnvar  model  to 
position  1  mtDNA  data  plus  rDNA  data  partition.  A,  An. 
albitarsis;  B,  An.  albitarsis  B;  C,  An.  marajoara;  D,  An.  deaneo- 
rmn. 
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Fig.  2.  Outgroup  attachment  distribution.  From  post- 
bum-in  samples  from  two  MCMC  runs  using  the  combined 
data  including  the  outgroup,  a  consensus  tree  was  made  such 
that  the  position  of  the  attachment  point  of  the  outgroup  was 
preserved.  Of  the  2000  samples,  most  (677)  had  the  outgroup 
attach  as  shown,  at  the  base  of  the  An.  albitarsis  B  clade. 
However,  during  the  MCMC  the  outgroup  attachment  po¬ 
sition  varied  widely,  spending  its  time  on  the  branches  in 
proportion  to  the  numbers  shown.  A,  An.  albitarsis;  B,  An. 
albitarsis  B;  C,  An.  marajoara ;  D,  An.  deaneomm. 

ible  (GTR)  rate  matrix  and  composition  parameters 
were  unlinked  between  the  two  partitions.  The  ap¬ 
plicability  of  this  model  was  confirmed  by  ML  in  p4, 
which  showed  an  increase  of  137  log  units  by  allowing 
free  rates  and  compositions  in  the  two  partitions,  at  a 
cost  of  eight  parameters,  compared  with  having  the 
same  rate  matrix  and  composition  in  both  partitions. 
As  suggested  by  ModelTest,  gamma-distributed 
among-site  rate  variation  was  applied  to  the  mitochon¬ 
drial  position  3  partition,  and  a  plnvar  model  was 
applied  to  the  mitochondrial  plus  ribosomal  partitions. 
The  Bayesian  tree  has  the  same  topology  as  the  ML 
bootstrap  tree.  Bayesian  posterior  probabilities  are 
shown  in  Fig.  1.  Three  groups  (An.  albitarsis.  An. 
albitarsis  B,  and  An.  marajoara)  were  recovered  with 
high  support,  and  a  fourth  group,  An.  deaneomm,  was 
recovered  with  somewhat  less  support.  The  support 
for  the  split  between  (An.  albitarsis,  An.  albitarsus  B) 
and  (An.  marajoara,  An.  deaneorum)  was  also  high. 

Phylogenetic  Analysis  with  the  Outgroup  Included. 
Here,  we  note  results  for  the  combined  data  and  ad¬ 
dress  the  question  of  where  the  outgroup  attaches  to 
the  ingroup  (tree  topologies  not  shown).  Parsimony 
analysis  (not  shown)  of  the  combined  rDNA  and 
mtDNA  data  sets  generated  12  MPTs.  The  strict  con¬ 
sensus  tree  generated  from  those  12  MPTs  recov¬ 
ered  one  major  group  consisting  of  (An.  albitarsis  + 
An.  albitarsis  B) ,  and  two  minor  groups,  one  formed  by 
An.  deaneomm  and  the  other  by  An.  marajoara  except 
An.  marajoara  (CIO).  The  relationship  between 


An.  marajoara  and  An.  deaneomm  is  very  poorly  sup¬ 
ported  (<49%),  as  is  the  group  consisting  of  An.  ma¬ 
rajoara  ( <33%) .  Within  the  An.  marajoara  clade, 
An.  marajoara  (Cl  1 ) ,  An.  marajoara  (C12),  and 
An.  marajoara  (C9)  formed  abetter  supported  group 
(72%  bootstrap  value) .  Bootstrap  support  for  the  clade 
(An.  albitarsis.  An.  albitarsis  B)  is  moderate  (93%)  and 
for  An.  deaneomm  is  low  (81%). 

For  the  combined  data,  ModelTest  suggested  the 
GTR  +  I  +  G  model.  However,  the  GTR  +  SS  model, 
where  the  data  were  partitioned  into  mitochondrial 
position  3  partition,  and  mitochondrial  position  1  plus 
ribosomal  partitions,  had  a  better  likelihood,  with  an 
increase  of  86  log  units,  and  so  the  GTR  +  SS  model 
was  used  for  ML  analysis  in  PAUP.  Using  ML  two 
major  groups  were  recovered  ( (An.  deaneomm,  An. 
marajoara),  (An.  albitarsis.  An.  albitarsis  B)  out¬ 
group  |.  Support  for  placement  of  the  outgroup  is  low 
(51%),  but  the  groups  consisting  of  An.  albitarsis  and 
An.  albitarsis  B  are  moderately  and  strongly  supported 
(92  and  99%,  respectively)  (Table  4).  For  the  Bayesian 
analysis,  a  site-specific  model  was  used,  but  a  test  was 
made  in  p4  using  maximum  likelihood  to  determine 
whether  individual  rate  matrices  and  compositions  in 
the  two  partitions  were  better  than  using  a  single 
overall  rate  matrix  and  composition.  The  increase  in 
the  log  likelihood  owing  to  a  separate  rate  matrix  and 
composition  was  205,  which  is  highly  significant,  and 
so  this  strategy  was  used  in  the  Bayesian  analysis.  The 
settings  were  as  described  above  for  combined  data 
without  the  outgroup.  That  includes  using  gamma 
model  for  the  mitochondrial  position  3  partition,  and 
a  plnvar  model  for  the  ribosomal  partition.  The  sup¬ 
port  for  relationships  among  the  sequences  of  each 
ingroup  taxon  was  generally  lower  when  the  outgroup 
was  included  in  both  ML  and  Bayesian  analyses  than 
when  the  outgroup  was  excluded  (Table  4). 

To  estimate  the  distribution  of  likely  root  positions, 
the  combined  postburn-in  MCMC  samples  from  two 
runs  were  reanalyzed  to  obtain  a  consensus  tree  based 
on  retained  root  information  (Fig.  2).  The  largest 
number  of  input  trees  (667/2,000)  had  the  outgroup 
attached  on  the  branch  separating  the  An.  albitarsis  B 
group  from  the  rest  of  the  tree.  However,  many  other 
input  trees  had  the  outgroup  attached  on  the  branches 
leading  to  the  (An.  marajoara,  An.  deaneomm)  clade, 
and  many  other  input  trees  had  the  outgroup  attached 
on  the  branch  leading  to  the  An.  albitarsis  clade.  Also, 
many  trees  had  the  outgroup  attached  on  the  branches 
leading  to  sequences  CIO  (An.  marajoara)  and  A4  (An. 
albitarsis),  which  we  interpret  to  be  caused  by  long 
branch  effects  and  disregard.  Few  trees  had  the  root 
attached  along  the  branches  leading  to  either  the  An. 
marajoara  or  An.  deaneomm  clades  separately,  show¬ 
ing  little  evidence  for  placement  of  the  root  on  this 
part  of  the  tree.  However,  note  that  the  ML  bootstrap 
tree  has  this  root  placement  (not  shown).  Pending 
further  observations  we  conclude  that  the  rooting  is  as 
shown  or  on  the  branches  leading  to  the  (An.  mara¬ 
joara,  An.  deaneomm )  clade  or  to  the  An.  albitarsis 
clade.  Stated  another  way,  we  find  that  the  largest 
number  of  sampled  trees  had  the  outgroup  attach 
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between  An.  albitarsis  B  and  [An.  albitarsis  (An.  ma- 
r a  jour  a.  An.  deaneorum)  |,  but  with  many  trees  having 
the  outgroup  attach  at  the  base  of  the  An.  albitarsis 
clade,  making  [outgroup,  (An.  albitarsis,  (An.  albitar¬ 
sis  B,  (An.  marajoara.  An.  deaneorum)))  |  or  on  the 
branch  separating  An.  albitarsis  and  An.  albitarsis  B 
from  An.  marajoara  and  An.  deaneorum,  making  [out¬ 
group  ((An.  albitarsis,  An.  albitarsis  B),  (An.  mara- 
joara ,  An.  deaneorum)) ).  A  topology  with  An.  albitar¬ 
sis  B  basal  to  the  others  was  not  recovered  in  any  of  the 
other  parsimony,  ML,  or  Bayesian  consensus  trees. 

Discussion 

In  a  search  for  the  best  evolutionary  hypothesis  for 
the  Albitarsis  Complex,  we  used  partial  sequences  of 
two  mitochondrial  genes  (CO I  and  ND4),  and  two 
ribosomal  DNA  fragments  (ITS2  and  D2  expansion 
region  of  the  28S  subunit)  and  compared  maximum 
parsimony,  maximum  likelihood,  and  Bayesian  analy¬ 
ses  with  several  combinations  of  data  partitions.  Indi¬ 
vidual  genes  failed  to  give  well-resolved  trees,  possibly 
because  of  the  low  number  of  variable  sites.  Also,  to 
optimize  our  results  we  analyzed  different  data  par¬ 
tition  combinations,  finally  settling  on  two  partitions: 
mtDNA  position  3  alone  because  it  was  more  variable 
and  presumably  faster  evolving,  and  mtDNA  position 
1  plus  rDNA  because  they  were  less  variable  and 
presumably  slower  evolving.  In  addition,  because  the 
outgroup  was  highly  divergent,  we  tested  for  long 
branch  effects  by  carrying  out  all  analyses  with  and 
without  the  outgroup. 

The  strongest  support  for  the  evolutionary  relation¬ 
ships  among  the  four  species  tested  was  retrieved 
when  all  four  genes  were  combined  and  partitioned  as 
described  above.  Analyses  excluding  the  outgroup, 
presumably  more  independent  of  long  branch  effects, 
offer  our  best  hypothesis  for  the  ingroup  topology 
(Fig.  1).  In  summary,  four  major  evolutive  lines  were 
recovered.  Two  groups,  An.  albitarsis/An.  albitarsis  B 
and  An.  marajoara! An.  deaneorum,  were  usually  re¬ 
covered,  but  not  in  all  analyses.  The  latter  group  in¬ 
cludes  two  species  that  are  important  vectors  of  hu¬ 
man  Plasmodium  in  localities  in  the  Amazonas  region 
of  Brazil  (Klein  et  al.  1991a,  b;  Conn  et  al.  2002).  This 
suggests  a  possible  phylogenetic  link  associated  with 
the  ability  to  transmit  human  malaria  parasites.  Similar 
conclusions  have  been  made  for  An.  albimanus  and 
An.  darting i  (Conn  1998). 

When  the  mtDNA  position  3  partition  was  analyzed 
alone,  An,  deaneorum  specimen  D17  and  An.  mara¬ 
joara  specimen  CIO  clustered  together  sister  to  the 
remaining  An.  marajoara  individuals.  This  result  sug¬ 
gests  the  possibility  of  incomplete  lineage  sorting,  in- 
trogression  (Donnelly  et  al.  2004),  or  even  the  exis¬ 
tence  of  an  additional  taxon. 

Our  analysis  of  the  distribution  of  possible  roots 
used  sampled  trees  from  MCMC  runs,  including  the 
outgroup.  It  showed  that  An.  albitarsis  and  An.  albi¬ 
tarsis  B  are  in  an  ambiguous  position  with  relation  to 
the  root  but  that  (An.  marajoara.  An.  deaneorum) 
generally  formed  a  clade.  Therefore,  we  think  that  the 


root  is  either  as  shown  in  Fig.  2  or  as  evidenced  by  the 
large  number  of  MCMC  sampled  trees  rooting  there, 
either  at  the  base  of  the  An.  albitarsis  B  clade,  or  at  the 
base  of  the  (An.  marajoara,  An,  deaneorum)  clade.  A 
more  definitive  answer  to  this  question  requires  more 
data. 

Lehr  et  al.  (2005)  using  the  entire  COl  gene  se¬ 
quence  data  of  29  individuals  of  An.  albitarsis.  An. 
albitarsis  B,  An.  deaneorum,  and  An.  marajoara  recov¬ 
ered  results  similar  to  those  generated  in  the  current 
study  with  the  mtDNA  position  3  data  partition,  in  that 
there  was  nonexclusivity  of  the  An.  deaneorum  clade 
with  respect  to  An.  marajoara.  They  showed  the  ex¬ 
clusivity  of  the  sequences  of  An.  albitarsis,  An.  albi¬ 
tarsis  B,  and  a  sister  group  relationship  of  these  two 
taxa.  They  also  found  four  individuals  of  An.  marajoara 
that  fell  outside  the  remaining  sequences  of  the  Al¬ 
bitarsis  Complex  (in  Bayesian  topology),  which  they 
suggest  represents  a  fifth  species.  The  individuals  that 
were  used  to  generate  these  sequences  were  collected 
in  Roraima  State,  Boa  Vista,  Brazil,  and  Venezuela. 
Interestingly,  Lehr  et  al.  (2005)  also  recovered  a  non¬ 
exclusive  clade  consisting  of  individuals  of  An.  mara¬ 
joara  and  An.  deaneorum.  A  similar  grouping  was  found 
when  we  analyzed  the  mtDNA  data  partition  for  the 
current  study  (An.  marajoara  specimen  CIO  clustered 
with  An.  deaneonttn  specimen  D17).  Lack  of  exclu¬ 
sivity  of  sequences  of  An.  marajoara  and  An.  deaneo¬ 
rum  are  similar  to  our  results  and  are  also  suggestive 
of  ancestral  introgression  or  perhaps  a  recent  specia- 
tion  event  that  could  not  be  detected  by  partial  se¬ 
quences  of  the  mitochondrial  genes  COI  and  ND4. 

The  current  study  was  based  on  conclusions  about 
species  boundaries  reached  using  fixed  RAPD  markers 
(Wilkerson  et  al.  1995a,  b).  Our  results  corroborate 
the  RAPD  evidence  that  indicates  four  putative  spe¬ 
cies:  An.  albitarsis.  An.  albitarsis  B,  An.  marajoara,  and 
An.  deaneorum.  No  additional  taxa  were  detected.  This 
is  not  surprising  given  that  the  same  genetic  material 
was  used  in  both  studies.  The  existence  of  a  fifth 
species  as  reported  by  Lehr  et  al.  (2005)  in  Boa  Vista, 
Brazil,  was  not  directly  tested  by  us  using  sequence  as 
described  in  this  study.  However,  we  assumed  the  Boa 
Vista  specimens  to  be  An.  marajoara  based  on  com¬ 
parison  to  one  or  two  diagnostic  RAPD  markers  (data 
not  shown).  The  possibility  of  a  fifth  species  is  sup¬ 
ported  by  independent  data  sets:  Kreutzer  et  al. 
(1976)  (chromosomes),  Rosa-Freitas  et  al.  (1990) 
(isozymes),  and  possibly  Narang  et  al.  (1993)  (allo- 
zymes,  mtDNA  restriction  fragment  length  polymor¬ 
phisms).  However,  in  support  of  a  hypothesis  for  the 
existence  of  An.  marajoara  as  a  single  widespread 
species  is  an  extensive  data  set  of  rDNA  ITS2  sequence 
from  throughout  its  range,  and  taxon-specific  PCR 
primers  based  on  that  sequence  (Li  and  Wilkerson 
2005).  If  in  fact  there  is  a  fifth  member  of  this  complex, 
the  RAPD  results  should  be  revisited  because  assump¬ 
tions  about  the  wide  distribution  of  An.  marajoara 
(Venezuela  to  southern  Sao  Paulo  State)  were  based 
on  the  existence  of  seven  RAPD  markers  that  were 
found  in  nearly  all  individuals  tested  from  all  parts  of 
its  putative  range  (Wilkerson  et  al.  1995a,  b). 


924 


Annai.s  of  the  Entomological  Society  of  America 


Vol.  98,  no.  6 


The  topology  of  the  gene  tree  reported  by  Merritt 
et  al.  (2005),  who  used  coding  sequence  of  a  portion 
of  the  white  gene  containing  its  fourth  intron,  varied 
significantly  from  that  found  by  us  and  Lehr  et  al. 
(2005) .  They  found  good  statistical  support  for  a  single 
loss  of  the  fourth  intron  in  the  species  complex 
(present  in  An.  marajoara,  absent  in  the  other  spe¬ 
cies),  but  weak  evidence  that  An.  marajoara  is  basal 
relative  to  [An.  alhitarsis  B,  (An.  alhitarsis.  An.  deaneo- 
rtim )  1 .  The  alternative  topology  placed  An.  marajoara 
sister  to  An.  An.  alhitarsis  B.  There  was  high  support 
for  the  sister  relationship  of  An.  alhitarsis  and  An. 
dcaneorum.  This  is  in  contrast  to  our  results  that  give 
high  support  for  a  close  relationship  between  An.  al¬ 
hitarsis  and  An.  alhitarsis  B  and  between  An.  mara¬ 
joara  and  An.  deaneortim. 

The  above-mentioned  conflicting  results  will  cer¬ 
tainly  require  additional  data  to  resolve.  A  recent 
report  by  Besansky  et  al.  (2003)  addressed  the  issue  of 
conflictingdatasets  in  the  resolution  of  species  bound¬ 
aries  and  phylogenetic  relationships  in  the  An.  gnm- 
hiae  complex  that  may  be  germane  in  solving  the  issue 
of  An.  marajoara,  and  resolving  the  phylogenetic  re¬ 
lationships  of  the  Alhitarsis  Complex.  They  found  ev¬ 
idence  supporting  both  introgression  and  reproduc¬ 
tive  isolation  as  well  as  different  tree  topologies, 
depending  on  which  sequence  was  sampled.  They 
concluded  that  adoption  of  a  "total  evidence"  ap¬ 
proach  for  phylogenetic  analysis  of  closely  related 
species  runs  a  risk  of  recovering  a  highly  supported 
wrong  answer  and  suggested  that  at  the  level  of  closely 
related  species,  it  would  be  better  to  do  a  careful 
locus-by-locus  assessment  of  sequence  divergence 
rather  than  just  adopt  a  total  evidence  approach.  They 
were  able  to  use  various  genes  on  all  the  chromo¬ 
somes,  inside  and  out  of  inversions,  as  well  as  mito¬ 
chondrial  genes,  for  their  conclusions.  The  approach 
of  Besansky  et  al.  (2003)  provides  a  model  for  future 
research  into  the  phylogenetic  relationships  of  the 
Alhitarsis  Complex. 
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